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Exhibit Bi to Accompany Declaration under 
^^^^ §1.131 for US Appln. No. 10/074,527 

Sec3uendier5« ^33945" 



fbhX33945phg01 b1 .a bi. 1 to 268 
fbhX33945phh01b1.abi. 1 to 361 
tbh X33945pgbb02b1.abi. 53 to 422 
f bhX 33945p9ba0 1 a1 abi . 53 to 424 
fbhX 3394Spgba02b1.abi. 53 t o 424 
fbhX 3394Spgbb01a1.abi. 65 to 424 
fbhX339 45phc01 a1 .abi. 1 39 to 424 
lbhX33945phd01a1.abi, 146 to 424 



AL136084.nt |GENSgANjrcdic t9d,C^ 273 to » . ^^'^^ 

^ AI8 6 3 865 in DB Est. 278 to 779 ^ \ ^ VC^ 

AA49318 7 in DBE st. 313 to 645 * ^1 vf , ^ 

AA4^9394 in DBEst. 317 to 779 ^"^^ 



At800923 in DBEst. 321 to 8, 



AA836046 in DBEst, 570 



AL1 36064.nt|GEN SCAN^redlcte .jL^^^i^ [ U*''^' 

AA401053I k \\A>f'^ ^(2^. 



AA429393 ii 
AL136084.nl 



....ctot, 1255 to 1j5 52 
»\Ll360 84.nt|GENSCAN_predicted_C, 1293 to 1826 
jThzcl 1 57a 07t1 .abi. 1 30 9 to 1477 

Cbhne001h07jot1.ab(, 1360 to 1604 



;ohne001h07l1.abi. 1360 to 1855 



AC007800.n tJGENSCAN_predicted_C. 1 4 23 to 1826 
cM hqad053c04a1 .abi. 1547 to 1 968 

cMhv f090g07a1.abi. 1652 to 2009 
AJ638649 in DBEst - Import > c. 1660 to 21 1| 
AA554045 in D BEst - Import - c. 16B5 to 2119 
AI916034 in DB Est - Import - c. 1716 to 2119 
jlhbaa 033c02t1.abi. 1719 to 2119 
johnd06 8h12t1.abi. 1753 to 20 83 
AI636959 In DBEst ^ Import - c. 1753 to 2128^ 
AA994913 in DBEst -^Import - c. 1758 to 2115 
cohvBAOOIelOal.abi. 1853 to 2009 




□ Holeincontig DiagraniKey 
^ Single fragment ma y*"" "^^v . 
@ Multiple fragments same direction 
S Both strands 
■ Both strands plus 



jybndav, Nbveniber 27. 2000 



Bumps on 
fragments 
shov motifs, 

Knilnw 
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Sequence Information 
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MI:fbhX33945pee03g1 (Human) 

Description: Sample Information 

Name: fbhX33945pee03g1 
Type: Full Length cDNA 
Submitted on: Dec 12, 2000 
NC project: Bayer 
Species: Human 
Tissue: unspecified 
Internal id: 107050574 



Sequence Information 
Creation metliod: PhredBaseCall 
Name: PHRED Base Calling Event 
Created on: Dec 14, 2000 
Created by: Pipeline 
Accession number: (none) 

Creation method: ABIBaseCall 
Name: ABI Base Calling Event 
Created on: Dec 14, 2000 
Created by: Pipeline 
Accession number: 107241068 

Analyses: Note: Excludes periodic reanalysis events. 



Description 


Date 


Requested by 


Sts 


Internal Db Check' > 


Dec14,2000 


Pipeline 


YE 


CUR Mildb Check 


Dec15,2000 


Pipeline 


YE 


Nucleic Acid Db Check 


Decie, 2000 


Pipeline 


YE 


Protein Db Check 


Dec16,2000 . 


Pipeline 


YE 


DBEST Db Check 


Dec16, 2000 


Pipeline 


YE 


PDB Db Check 


Dec 16, 2000 


Pipeline 


YE 


Signal Peptide Check 


Dec 16, 2000 


Pipeline 


YE 


1.4 BLAST vs. PNU 


May 28, 2004 


Olga Tayber 


SU 


1.4 BLAST vs. NUC 


May 28, 2004 


Olga Tayber 


SU 



Base Array: pHRED Base Call - complete 

Color and case key: NORMAL, low quality, vector, repeat 

Ttgggtaccgggccccccctcgaggtcgacggtatcgataagcttgatatcgaattcctg 
cagcccgggggatccgcccgcgcagaTCGCTGGCTGCAGTTGGCGGGCGGATGTGGGGGC 
GCACGGCGCGGCGGCGCTGCCCGCGGGAACTGCGGCGCGGCCGGGAGGCGCTGTTGGTGC 
' ' TCCTGGCGCTACTGGCGTTGGCCGGGCTGGGCTCGGTGCTGCGGGCGCAGCGTGGGGCCG 
GGGCCGGGGCTGCCGAGCCGGGACCCCCGCGCACCCCGCGCCCCGGGCGGCGCGAGCCGG 



http://informalics.mpi.com:8094/aw/AnalvsisWorkbenchServlet?function=getSinRleView 6/10/2004 



Sequence Information 
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TCATGCCGCGGCCGCCGGTGCCGGCGAACGCGCTGGGCGCGCGGGGCGAGGCGGTGCGGC 
TGCAGCTGCAGGGCGAGGAGCTGCGGCTGCAGGAGGAGAGCGTGCGGCTGCACCAGATTA 
ACATCTACCTCAGCGACCGCATCTCACTGCACCGgggctagagcggccgccaccgcggtg 
gagctccagcttttgttccctttagtgagggttaattgcgcgcttggcgtaatcatggtn 



Comments: 



Author 



Date 



Comment 



Exhibit B3 to Accompany Declaration under 
33945 37 CFR §1.131 for US Appln. No. 10/074,527 

Secjuendher*" ^^33945" 



ibhX3394Sptig01b1.abl. 1 to 2M 




|. 273 ta 621 
AiggMS in DBEW. 278 tt> 779 ^ 
AAWia ? h OBEi l. ai3 la^«45 

MA Om In DBErt. 3»7 10 778 
^ AI800923 h OBErt. 321 to B74 



AK024869 In GenBuk. S08 to 28S0 



AL13C084.iU|GE NSCAN_|>rw«cte d,C. 622 to 612 

^AA40tOS3 In OBEst. 672 to 1146^ 
AM29393 In OBEst. 675 to 1 148 



AL136 064.nttGENSCAN.j)fadictad_C. atO to 1293 
flEt67242 Jn DSesl. 101 4 to 1 190 

AWS43782 hi DBEst - Import • c. 1 156 to 1301 



AWBt4059 In TOEst - Import -j i. 1227 to 1447 
|ll twh204g03t1^ 1249 to 1643 
tl hAa158a12t1.aM. 1255 to 1M2 
AL13 «»4.ntlGENSCAN_jyt<gcW<LC. 12»3 to 1826 
frittCl157 a07t1.aW. 130 9 to 1477 

cti(w00 ^l>07|o».1ii. 136^ 10 1604 

|0hnaQ01h07t1.1bl, 1360 to 18S5 



AC007eOO .BHSEMSCAKjfidlctod.C. 142 3 10 1626 
. 1547 te^1W8 



cMl wtOOOg07a1 



Jitll. 1652 to 2009 



AA5S4 q45 In D8EW • Import • c. 1665 to 21 19 
AI916034 In OBEtt . Import - e. 1716 to 2119 



|Ohn d06Bh1211.at)l. 1753 to 2 0B3 
M6368S9 h DBEst - Import - c 1753 to 2128 



cotivBAOOIalOal.abt, 18S3 to 2009 



1 138 



428 



668 816 1.014 



1.386 1.650 1.852 



2.132 2.850* 



99 W 



III II 



I II ill III I III 1 ^ fi II 1 1 II ' i " I TF'ii 



ill TilT ' 



-> — t- 



II 11 I I I I II 



I I I II 



i l l I I mil 



Multiple fragments same direction 
Both strands 
Both strands plus 
r ■ 'iY'i 'r- »>^stirt oodon firame 1 



. .fitAn AftHnn franrwk 9 



Monday, Deoariber 18, 2000 



Bumps on 
fragments 
shov motifs^ 
hollow 
rectangles 
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fi Amp 1ft TTifoiniat- jftn 

name: Fbh33945FL 
type; Plain DNA Sequence 
submitted on: December 18 2000 
NC project: Bayer 
species: Human 
tissue: unspecified 
internal id: 107445732 
restrictions: none 

creation method: Loading 
name : Load 

created on: December 18 2000 
created by: Peter Olandt 
accession miniber: 107445733 



> Fbli33945FI< - Import - coiqplete 



1 


CCGGCTCGGT 


61 


TGGCTGCAGT 


121 


TGCGGCGCGG 


181 


GCTCGGTGCT 


241 


GCACCCCGCG 


301 


CGCTGGGCGC 


361 


AGGAGGAGAG 


421 


ACCGCCGCCT 


481 


ATTTGCCCAG 


541 


GGACAGTTTA 


601 


TAGATGACTA 


661 


TGCCCAAGGT 


721 


TGGGGGCGTC 


781 


ACGAAGGGTG 


841 


GCCCGGTGAT 


901 


CCCAGATCGG 


961 


AGAGGATACG 


1021 


TGTTTGCTGT 


1081 


TTTGGGGAGG 


1141 


AAACACACCC 


1201 


ACAAGGCTCT 


1261 


TCTACTACCA 


1321 


AGCAGCTCCG 


1381 


CAGAACTGCA 


1441 


TAACAGACTA 


1501 


TCATTCTGTA 



Exhibit B4 to Accompany Declaration under 
37 CFR §1.131 for US Appln. No. 10/074,527 



ACCACTATAA CGGCCGCCAG TGTGCTGGAA TTCGCCCTTG CGCAGATCGC 
TGGCGGGCGC ATGTGGGGGC GCACGGCGCG GCGGCGCTGC CCGCGGGAAC 
CCGGGAGGCG CTGTTGGTGC TCCTGGCGCT ACTGGCGTTG GCCGGGCTGG 
GCGGGCGCAG CX5TGGGGCCG GGGCCGGGGC TGCCGAGCCG GGACCCCCGC 
CCCCGGGCGG CGCGAGCCGG TCATGCCGCG GCCGCCGGTG CCGGCGAACG 
GCGGGGCGAG GCGGTGCGGC TGCAGCTGCA GGGCGAGGAG CTGCGGCTGC 
CGTGCGGCTG CACCAGATTA ACATCTACCT CAGCGACCGC ATCTCACTGC 
GCCCGAGCGC TGGAACCCGC TGTGCAAAGA GAAGA/^TAT GATTATGATA 
GACATCTGTT ATCATAGCAT TTTATAATGA AGCCTGGTCA ACTCTCCTTC 
CAGTGTCCTT GAGACATCCC CGGATATCCT 6CTAGAAGAA GTGATCCTTG 
CAGTGATAGA GAGCACCTGA AGGAGCGCTT GGCCAATGAG CTTTCGGGAC 
GCGCCTGATC CGCGCCAACA AGAGAGAGGG CCTGGTGCGA GCCCGGCTGC 
TGCGGCGAGG GGCGATGTTC TGACCTTCCT GGACTGTCAC TGTGAGTGCC 
GCTGGAGCCG CTGCTGCAGA GGATCCATGA AGAGGAGTCG GCAGTGGTGT 
TGATGTGATC GACTGGAACA CCTTCGAATA CCTGGGGAAC TCCGGGGAGC 
CGGTTTCX3AC TGGAGGCTGG TGTTCACGTG GCACACAGTT CCTGAGAGGG 
GATGCAATCC CCCGTCGATG TCATCAGGTC TCCAACAATG GCTGGTGGGC 
GAGTAAGAAA TATTTTGAAT ATCTGGGGTC TTATGATACA GGAATGGAAG 
AGAAAACCTC GAATTTTCCT TTAGGATCTG GCAGTGTGGT GGGGTTCTGG 
ATGTTCCCAT GTTGGCCATG TTTTCCCCAA GCAAGCTCCC TACTCCCGCA 
GGCCAACAGT GTTCGTGCAG CTGAAGTATG GATGGATGAA TTTAAAGAGC 
TCGCAACCCC CGTGCCCGCT TGGAACCTTT TGGGGATGTG ACAGAGAGGA 
GGACAAGCTC CAGTGTAAAG ACTTCAAGTG GTTCTTGGAG ACTGTGTATC 
TGTGCCTGAG GACAGGCCTG GCTTCTTCGG GATGCTCCAG AACAAAGGAC 
CTGCTTTGAC TATAACCCTC CCGATGAAAA CCAGATTGTG GGACACCAGG 
CCTCTGTCAT GGGATGGGCC AGAATCAGTT TTTCGAGTAC ACGTCCCAGA 



1561 


AAGAAATACG 


CTATAACACC 


CACCAGCCTG 


AGGGCTGCAT 


TGCTGTGGAA 


GCAGGAATGG 


1621 


ATACCCTTAT 


CATGCATCTC 


TGCGAAGAAA 


CTGCCCCAGA 


GAATCAGAAG 


TTCATCTTGC 


1681 


AGGAGGATGG 


ATCTTTATTT 


CACGAACAGT 


CCAAGAAAT6 


TGTCCAGGCT 


GCGAGGAAGG 


1741 


AGTCGAGTGA 


CAGTTTCGTT 


CCACTCTTAC 


GAGACTGCAC 


CAACTCGGAT 


CATCAGAAAT 


1801 


GGTTCTTCAA 


AGAGCGCATG 


TTATGAAGCC 


TCGTGTATCA 


AGGAGCCCAT 


CGAAGGAGAC 


1861 


TGTGGAGCCA 


GGACTCTGCC 


CAACAAAGAC 


TTAGCTAAGC 


AGTGACCAGA 


ACCCACCAAA 


1921 


AACTAGGCTG 


CATTGCTTTG 


AAGAGGCAAT 


CATTTTGCCA 


TTTGTGAAAG 


TTGTGTTGGA 


1981 


TTTAGTAAAA 


ATGTGTU^TAA 


GCTTTGTACT 


TATTTTGAGA 


ACTTTTTAAA 


TGTTCCAAAA 


2041 


. TACCCTATTT 


TCAAAGGGTA 


ATCGTAAGAT 


GTTAACCCTT 


GGTATTTAGA 


AAATTAAAAC 


2101 


CTTATAATAT 


TTTTCTATCA 


ARAWRWAWAT 


TTTACAGTCG 


TGCCTTTTAC 


TCTCATTAGC 


2161 


AAAAAAGATA 


AAGATTTTAT 


TTTCGTATTT 


ACAAGAATTC 


CCAGGTACGA 


AGATATCTGC 


2221 


ATGGGTGGAA 


ATCAGGTTCA 


AGCAACGTAC 


TTTGCATTAA 


CTGATAATAC 


CTCAGCTGCG 


2281 


GGGTTAAAGT 


TTTCCCAGTA 


TAGAGAGACT 


GTCACTAGGA 


ACATTGTATT 


GATTTATTCA 


2341 


GGTCATTGAG 


ATCTTCTAGA 


TGTATTTTAA 


AAAGAATGCT 


TTTTGGTTAT 


GT6TTGCTAC 


2401 


CACAGTTAAC 


ACTCCATAAT 


GTTCATGTCA 


GCCAAAGAGG 


ACTAACCAAA 


GCTGAAATCT 


2461 


CAGAGAACAA 


TTTGCTTTAC 


TAAGCTGAGT 


CAACTTGAGA 


GCGAACTTCT 


AACAATGCCG 


2521 


CACTGTAGTG 


TGGCTGGTTC 


TACCACTATG 


ACTTTAAAAC 


ATGTTTATAT 


CATTTTTAAT 


2581 


TTTTATGATA 


CGGTAGTGTC 


AGGGAGAAAT 


GTAATGTTCT 


ATATGAAATT 


CCTTTTTCAA 


2641 


GTTTGTTCAT 


TAATAACAGT 


TATTAATTTA 


AATCAGCGTT 


AGAGTTTGTG 


CTGCTGCAAC 


2701 


TGCTGTGAAA 


ATTTCTCTGA 


GTAATTCTGA 


TTTGTGAATG 


ATCCCAGACC 


AACCCTGAGA 


2761 


TTTTGTCAAC 


CTGATTAAGT 


CAATATGAAT 


GATTAAAAAG 


ATGTGAGAAC 


AAAAAAAAAA 


2821 


AAAAAAAAAA 


AAAAAAAAAA 


AAAAAAAAAA 









Back 10 orfanalxei 

Analysis of 33945 (581 aa) 
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I I I I I I I 



I ' { ' J ' I • I • I ' I ' f • ! M ' I ' I ' I ' I ' I ' ! ' I ' I ' I • ) ' I ' I ' I ' I ' I ' I ' I • i M ' i 
1 41 Bl 121 161 201 241 281 321 361 401 441 481 521 561 



l««CRTAIUUlCPREXRRGREALLVLUUJlUiCLCSVLJUQRCAGAGAAEt>CPFRTPRPGR 

REPVMPRFPVPANAUIARCEAVHL0M?GEELRU5EE:SVRLi^INIYLSDRlSL.HRflLPER 

tlHPU:KEXKYirrim.PRTSVI I JUnrNEKWSTUJtTVnrSVUCTSPDIUiEEVILVOOYm 

DILKEIOJUTELSGLPKVRLIMNKRBGLVRAIlLtAASMKUIVb 

IWRIIffiEESAVVCPVIDVIIXfMTFEnriiGMSCEPQlOCFOimLVITWHTVPERE^ 

PVDVIRSPT1«AGGLFAVSncyPEYU>SYCm»EVnK»E9fLEPSPRIW0COG 

VCHVFPKOAPySIQIMLMISVRMEVMMDITKSLYYHntmU^ 

<X:iCPF»CPIXTVCTELBVPEORPOPiqiI^ I B a. T U V C HW^ 

CHBONQFPBrrSQKEIimnilQPIXXXAVEKCMin'LIIUIfEETAPENOKF 



PSORT Prediction of Protein Localization 

HITDISC: discrimination of ultoctiondrial targetino seq 
R content: 8 Hyd Homnt |7S) : 13.73 

Hyd Moment (95) : l€.6S Ccontant: 2 
D/E content: 2* 8/T content t 1 

Scor«: 1.S5 

Gavel: prediction of claavaga slcea for mitochondrial prescq 
R-2 motif at 19 RRClPR 

MUCmsC: diacrlai nation of nuclear localization signals 
pat4: none 

I>at7: PLCKEKR (9) at 133 
bipartite: none 

content of basic residues: 12. 9t 
NLS Score: -0.22 

ER Henbrane Retention Signals: 

XXRR-like Botif in the H-temlnus: WCRT 



XXXX-like Botlf in the C-teradnust KERM 

Final Results (k - 9/23): 

43.5 %: mitochondrial 
30.4 t: cytoplasaiic 

8.7 I: vacuolar 

8.7 %: nuAear 

4.3 %: Golgi 

4.3 %; endoplasBlc reticulum 

prediction for 33945 is aSt (k-23) 



SUrt 


End 


Feature 


1 s„ 1 


86 


107 


Leucine ripper pattern (PS00Q29) 


1 LQGEELRLQE... VRLHQINIYL | 



Signal Peptide Predictions for 33945 



Method 



I Predict 



SignalP (eukaryote) | YES 



Score|Mat@l 



il TOas ttted tar fisnsi pcpUdc prcdlcUan 



Transmembrane Segments Predicted by MEMSAT 



Start 


End 


Orient 


Score] 


20 


37™ 


mi->ins 


43 J 



>33945 

MWSRTARRRC PRELRRGREALLV tlA LJWLGSVLRAORCAajyBAAEPOPPRTPRFGR 
REPVMPRPPVPAr»LGAHGEAVRLQLQGEELRLQEESVRLH01»n YLSnRl SLHRRLPER 
WPUrKEKKYDYIMLPRTSVI IAFYNEMfSTUJl1VySVLeTSPDIU<EEVILV0DySDR 
EHUCERZAHEbSGLPKVRLIRAtnWBGLVRARLLGASMUVS)^ 



LLORIHEBESAWCPVI Wir»IMTTm<WSCEPgiOCraiHIttWPniHTVPEREBlFH0S 
PVWXKSPTlAO(n.FAVSiatYrEYIiGSyDT(aiEVHaGEMLEPSFRINQCO0VLEZHPCSH 
VQH VFPI CQiMYBFWKMJUISVIUUaVWHDEFMXTYBIWPW 

QCIUlPKHFtElVyPBLtlVPEDItKITOHLOHnQLTDyCPinrNPPDOIOIVCHQVI LYLCH 
GMBOMOPFEYTSOKEIRYNTKOPBOCXAVEACHDTLIMHLCeCTAPOlQKPILOEDGSLP 
HEOSIWCVOAXRKESSDSFVPLUUXrmSDHQKKFFKERML 

Transmembrane segments for presumed mature peptide 



Prosite Pattern Matches for 33945 

Prastic vtrslMi: RcIcmc 12.3 wt Pcknury 1995 

>ESmfl4|PDOC00004|CAMP^PHOSPHO_SJTB CAMP- aiuS eGHP-d«p«fMtent protein kinM* phosphoxylation sic*. 

Query: 552 RKES 555 

>E££iaa5jPO0C00005|PKC«PHOSPHO_slTE Protein kinss* C pho^^rylation site. 



Query: 


5 


TAR 


7 


Oueryt 


SS 


TPR 


57 


Query: 


97 


SVR 


99 


Query: 


106 


SDR 


110 


Query: 


178 


SDR 


180 


Query: 


318 


SIOC 


320 


Query: 


343 


SFR 


3«S 


Qoery: 


380 


SVR 


382 


Query: 


411 


TER 


413 


Query: 


492 


SOK 


494 


Query: 


544 


SKK 


54 S 



>£S£222L|PDOC00006|CK2_PHOSPHO_siTB Casein kinase ii phosstborylatlon sit*. 



Query: 


15B 


SVLE 


161 


Query: 


162 


TSPD 


165 


Query: 


178 


SDRE 


181 


Query: 


225 


TFU3 


220 


Query: 


289 


TVPE 


292 


Query: 


300 


SPVD 


303 


Query: 


330 


TGHE 


333 


Query: 


492 


SOKE 


495 


Query: 


524 


TAPE 


527 


Query: 


567 


THSD 


570 



>Ea!ai£Q2|PDOC00007tTYn^PHOSPHO_SlTe Tyrosine kinase phosphorylation site. 

Query: 426 KWFLETVY 433 

>£Sgfl,Ci'C B I PDOCO 000 9 1 MV RISTYL H-myristoylation site. 



Query: 


42 


GACAGA 


47 


Query: 


207 


GLVFAR 


212 


Query: 


215 


GASAAR 


220 


Query: 


312 


GGLFAV 


317 


Query: 


326 


GSYDTG 


331 


Qqcxy: 


351 


GVLETH 


356 


Query: 


447 


GMLQNK 


452 


Qoery: 


461 




486 



>£Sfii|£fiS|PDOC00OD9|AiaDATI0N Aaldation site. . 
Query: 58 PGRR 61 

>PSQCOl R t ^DOCQOO 1 I unn cell attactnent sequence. 
Query: 220 RGD 222 



>esaiUi2SlrooC00029|uniCI1«e_ZXPPER Lwicln* lipptr pattern. 
Omzy: 86 LQOEELftLOEESVRLHOnilYI' 107 



7 

Protein Family / Domain Matches, HMMer version 2 

Searching for conipI»t« dostains in P7AM 

hnnpfan - search a single saq against WM «Satabas» 

HMIER 2.1.1 (Dec 1996) 

Copyright IC) 1992-1998 Washington Univtrsity School of Mdicine 
HMKER Is freely distritwtcd under. the GNU General Public License IGPLI . 

Hm file; /prod/<]d»/seqanal/PFAM/pfMS.S/Ptaai 
sequence file: /prod/dtte/Hmsee/orfanal/oa-Berlpt. 29918. seq 



: 8e<iuence family classification 4score includes all donains) : 

Description Score B- value H 



Qlycosyl transferases 85.1 1.4e-3 

R«ftn R i*rf<a Similarity to lectin donain of ricin 18. '7 0.001 

Parsed for doaalns: 

Model Domain seq-f seq-t lan-f Imra-t score B-value 



Glycos transf 2 1/1 139 322 . . 1 187 |) 8S.1 1.4e-21 

Rlcln_B_lectin 1/1 441 577 . , 1 14S fl 18.1 0.0018 

Alignments of top-scoring domains: 

Olyeo*. transf. 2 t domain 1 of 1. froa 139 to 332: score 85.1, E * t.4e~21 
♦ ->si vIptYNeeadyLeell esvl aqs . 1 1 edi e i i wDDgSe cD« t v« 
s**I* YNe*-»**L«**» Svl* B** * e+i vDO S ♦ ♦♦«! 
3394S 139 SVXIAPyNEKKSTLUmnrSVLETSpDILLEEVILVDDYSDREHLKE 185 

iaedylder i keenprl i ivi rlecnaqCpaaAr AkgirratodsdyZ 1 f 

♦ ♦ i ♦ P* ♦♦♦ir 0 ♦♦Ar ♦g ♦♦a-»Od 

33945 188 RLANEL SGLFK-VRLIRANKRE-GI.VRAKLLCASAASGD--VLTF 226 

LOaDdi f tpdklekl IdyasatdaAwlgaida yeyaegesnly . 

IjD****** ♦♦ie-»l*+*+ ♦ aw*-» id* ♦ ♦♦♦ey ♦ ♦ ♦ 
33945 227 LDCHCECHBCWLEPLLQRIHEEESAWCPVIDVidwntFEYLGNSGEPOi 276 

riaradterslf aoJJrktgrlcgglelsfeiosnaiyrreaf eel f <-• 
♦ ♦ ♦ ** K* *x •» ■•♦♦**a* ♦ ♦ ♦♦♦f 
33945 277 O0fTII«LVFTWITVPEREIUW«SP-VWlRSPTt«CGLFAVSHCyF 322 

Klelm_a_leetiat domain 1 of 1. from 441 to 577: score 10.7, E - 0.0016 
•->ravfvitil .vngnsgrClDvnsssesdS. .nqvqlwaChsnpgknQ 
* l* g «4 C D n ■» qv 1 Ch* *g nQ 

33945 441 DltPQFFQMUtfaCCLt1)yCFDyMPPDEN01vgI10VILyU»6-HSO^ 466 

Mffsl t Ydesdgei r b . vvnnd)cCLtvnanqpgseVkl yqCdsat sdnQMl 

♦ -ft s ♦eir * * *C ♦V'te ♦ ♦ ♦ C *♦ *nOk* 
33945 487 FFEYT— SQKEZRynTHOPEBCIAVEAGKDT— LIMHLCEETAPBIDKF 531 

elnndgl ignki 1 Inl vntglvLdvkgadtqngTkl i lytcsggrnqqwi 
1 dg-»« ♦♦ + ♦*♦■» ■» l««c«*« q+W 

33945 532 ILQEDGSLFH BOSKKCVDAARKBSSDSFVPLUUXriNSDHOKHP 575 

pt<-* 

33945 576 FK 577 

// 

Searching for complete domains in SKART 

teagpCaa - search a sinole seg against KMM darabase 

HMMER 2.1.1 (Dec 1998r 

Copyright (C) 1992-1998 Washington University School of Medicine 

HMKER is freely distributed under the GNU General Puhlic License (GPL). 

HHM file: /dAa/robiaon/smart/nart/soart .all .hans 

Sequence file± /prod/d<kB/w8pace/orfanal/oa>aeript.39948.Beq 



Scores for sequence family classification Cacore includes all domains): 

Model Description Score E- value 



Parsed for domains: 

Hodel Domain seq-f seq-t Ina-f hara-t score B-value 



rlcin-3 1/1 445 .577 .. 1 135 M 73.3 5,le-lB 
Alignments of top-scoring domains: 

ricl4_Si domain 1 of 1, frcos 445 to 577: score 73.3. B • 5.1e-18 

* ->rgyf 1 i iggntglCLdvngnsesksdQnpvqlwdChg.ggnQlWklt 
+♦ g t* C d*n ♦e-» G ♦v+l+ Chg g nO ■» ■•t 
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„ p99.2 (2(0PAG T(3) //TRAKSFERASE N-ACCTYLGALACTOSAMINYLTRANSFERASE 
138 |k>LYFEFnDEACEIYLGAI^CT0SAMIhrn,11tANSF^^ 
GLYCOSYLTRANSFERASE PROTEIN-UDP PROTEIN- UDP N- 
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p99.2 (314) N0DC{8) PAGT(3) CAG2(3) // PROTEIN TRANSFERASE GLYCOSYLTRANSFERASE 
PUTATIVE BIOSYNTHESIS GLYCOSYL SYNTHASE TRANSMEMBRANE 
N-ACETYLGALACrOSAMINYLTRANSFERASE MEMBRANE 
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p99.2 (23) PAGT{3) // N-ACETYLGALACIt>SAMINYLTRANSFERASE TRANSFERASE 
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>2}0B p99.2 423) PACTOI // N-ACETYLGALKCT0SMIIliyLTItJtllSPEIU5E TRANSFERASE 
rOLYPEPTIoe ACerYLCAUCTOSMUIiyLTRMISPERASB UDP-GAUWCrPOLTPEPTXDE 
GLyCOSYLTUMISFERASE PROTEIN-UDP PROTEIN- UDP N- 
Lvngth - 172 

Scor* - 566 4204.3 bits). Expect « 3.3e-55, P » 3.3e-55 
Identities - 102/166 )6Hl, Positives - 128/166 177%) 

Query: 287 HMTVPERE»IW«S-PVDVIRSPTKAGGLrAVSiaWFEyUISym'GMEVWKEN^ 34S 

WH VPE ER R ♦ P D mSPTWAGGLFA-» ♦K+YFE LG+YD Qif «WGCIKI.E SFH 
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4WQO0C LE PCSKVCHVF K**PY« *■» N-»VR AEVtgMD««KE *t 

Sbjcts 62 VMOCtKRLEWPCSKVGHVFRKRSPYTFPGXOSGlOJVISRllTVRVABVHMDDyi^^ 121 
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>ei7 p99.2 I70> PACT (3) ABPB<2) E13Bt2) // SIGNAL HYDROLASE PRECURSOR TOXIN 
RRMA H-OLYCOSlDASe GLYCOSIDASE LECTIN REPEAT CLYCOPROTEIM 
140 



Score • 104 C41.7 bit*), Expect - 3.1e-05, P - 3.1e-05 
Identities - 29/128 (22%). positives - S5/128 <42t) 
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>26fll P99.2 (20) PAGTC3) // TRANSFERASE K-ACETYLGALACTOSAMINYLTRANSPERASE 
POLYPEPTIDE ACETYLGALACTOSAMIWYLTRANSFERXSE UDP-0ALMAC:POLYPBPTIDB 
OLYCOSYLTRAHSFERASB PROTEIN-UDP PROTEIN- UDP N- 
Length - 118 

Score - 99 (39.9 bita). Expect - 0.00011. P - 0.00011 
Xdentitles - 21/39 (53%), Poeitivea - 27/39 (69«) 

Query: 101 l^miYLSDRISLHRRLPBaiHPLCKEKXYD-YOMLPRT 138 

♦0 M»T SD ISL»R LPt CK +1^+ YCNLP T 

Sbjct: aO NQFNLYASDMISLNRSLPDVRPEECKTOKYNPYDNLPTT 116 
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GLYCOSVLTRANSFERASE PVTATIVC BIOSYNTHESIS GLYCOSYL SYNTHASE TRANSHEKBRANE 
N-ACETYLOAtACTOSAMINYLTRAHSFERASE HEHBRANE 
Mnoth • 196 

Score • 108 <43.1 bits). Expect > 0.0005«, 9 « 0.000S6 
KSenticies - 46/163 (28«), Positives « ''4/163 {*5%) 
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length «• 196 

Score • 74 431.1 bltst. Expect > 4.9. P - 0.99 
Identities • 23/80 (28%), Positives - 41/60 (Sit) 

Query: 108 SI»ISU{RRLPERNNPUnCEXKYOYI»ILPRTSVIIAPYNEAWSTL(JtTVYSVLETSPOIL 167 
S * R P«R PL K * DDL RTS * * ¥S*L * * *** DI« 
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